Automatically set EncodingType to url #726

JordonPhillips · 2015-11-23T23:02:20Z

Some unicode values can break the XML parser, so we have s3 url encode the keys and decode them on our end.

mtdowling · 2015-11-23T23:40:08Z

botocore/handlers.py

+    # This is needed because we are passing url as the encoding type. Since the
+    # paginator is based on the key, we need to handle it before it can be
+    # round tripped.
+    if 'Contents' in parsed and parsed.get('EncodingType') == 'url':


I wonder if we should only automatically decode the URI encoded values if the EncodingType was not explicitly set by the user? For example, if a user actually looks at the S3 API and sees that you can set and EncodingType, then they might reasonably expect that the results they receive are encoded... In fact, that's probably the behavior they are expecting right now. If we automatically start decoding these, it would be a breaking change.

There's not really any way to know that though.

Yeah, I think we'd have to add support for that in order to make it backwards compatible.

kyleknap · 2015-11-24T18:21:33Z

tests/integration/test_s3.py

@@ -176,12 +176,12 @@ def test_can_delete_urlencoded_object(self):
        bucket_contents = self.client.list_objects(
            Bucket=self.bucket_name)['Contents']
        self.assertEqual(len(bucket_contents), 1)
-        self.assertEqual(bucket_contents[0]['Key'], 'a+b/foo')
+        self.assertEqual(bucket_contents[0]['Key'], u'a+b/foo')


Why do you need to add the unicode prefix to these? I feel like these should still pass with the unicode prefixed removed.

kyleknap · 2015-11-24T18:23:22Z

Implementation looks fine. It would be nice to have an integration test for this by adding a key that would normally break the xml parser and show how it is now fixed with autopopulation of encoding type and also show that it is still url encoded (not automatically decoded) if the user specified it.

kyleknap · 2015-11-24T20:46:50Z

tests/integration/test_s3.py

+        self.create_object(key_name)
+        parsed = self.client.list_objects(Bucket=self.bucket_name)
+        self.assertEqual(len(parsed['Contents']), 1)
+        self.assertEqual(parsed['Contents'][0]['Key'], key_name)


Could we also do a second call where we specify EncodingType=url and show that it is not being url decoded? You could just add it to this test.

kyleknap · 2015-11-24T20:48:51Z

Just one more thing. It should be good to merge. Also make sure you have a PR for the CLI to update it so we can merge this and the corresponding CLI PR to avoid test failures when we just merge this PR.

JordonPhillips · 2015-11-24T20:57:19Z

👍

kyleknap · 2015-11-24T21:02:33Z

Thanks 🚢. We can merge this when the CLI PR is made to update the CLI based off of these changes.

mtdowling · 2015-11-25T17:56:16Z

jamesls · 2015-11-25T18:12:00Z

botocore/handlers.py

+    # paginator is based on the key, we need to handle it before it can be
+    # round tripped.
+    if 'Contents' in parsed and parsed.get('EncodingType') == 'url' and \
+                    kwargs['context'].get('EncodingTypeAutoSet') == True:


and context.get('EncodingTypeAutoSet', False): or just:

and context.get('EncodingTypeAutoSet')

jamesls · 2015-11-25T18:15:23Z

Just had a few small comments.

Automatically set EncodingType to url

Following boto/botocore#726, list_objects will automatically include a encoding-type=url query param. However, the client does not decode all of the response elements properly -- notably, Prefix would remain encoded. Hopefully this will be fixed soon-ish (I've got a patch proposed at boto/botocore#1901) but in the meantime, use the work-around suggested in boto/boto3#816 of unregistering the set_list_objects_encoding_type_url handler. Signed-off-by: Tim Burke <tim.burke@gmail.com>

mtdowling reviewed Nov 23, 2015
View reviewed changes

JordonPhillips force-pushed the s3_ls_encoding_type branch from 4e88b5a to 3d89b31 Compare November 24, 2015 01:38

JordonPhillips added the pr/needs-review This PR needs a review from a member of the team. label Nov 24, 2015

kyleknap reviewed Nov 24, 2015
View reviewed changes

JordonPhillips force-pushed the s3_ls_encoding_type branch from 4f0687c to b69f60b Compare November 24, 2015 20:56

JordonPhillips mentioned this pull request Nov 24, 2015

Remove customizations in BucketLister that attach encoding_type aws/aws-cli#1658

Merged

jamesls reviewed Nov 25, 2015
View reviewed changes

JordonPhillips added 2 commits November 30, 2015 10:33

Automatically set EncodingType to url

4b5b984

Added context for the lifecycle of a request

18a590e

JordonPhillips force-pushed the s3_ls_encoding_type branch from a9d9971 to 18a590e Compare November 30, 2015 18:34

JordonPhillips added a commit that referenced this pull request Nov 30, 2015

Merge pull request #726 from JordonPhillips/s3_ls_encoding_type

843a66c

Automatically set EncodingType to url

JordonPhillips merged commit 843a66c into boto:develop Nov 30, 2015

JordonPhillips mentioned this pull request Dec 29, 2015

Not well-formed (invalid token): line 2, column 344955 #440

Closed

mtdowling mentioned this pull request Jan 21, 2016

Adding missing url decoding from S3 responses. #765

Merged

JordonPhillips deleted the s3_ls_encoding_type branch March 21, 2016 21:28

JordonPhillips mentioned this pull request Sep 23, 2016

S3 Bucket no Encoding Type boto/boto3#816

Closed

tipabu mentioned this pull request Nov 25, 2019

Fix test_bucket_list_prefix_unreadable ceph/s3-tests#327

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatically set EncodingType to url #726

Automatically set EncodingType to url #726

JordonPhillips commented Nov 23, 2015

mtdowling Nov 23, 2015

JordonPhillips Nov 24, 2015

mtdowling Nov 24, 2015

kyleknap Nov 24, 2015

kyleknap commented Nov 24, 2015

kyleknap Nov 24, 2015

kyleknap commented Nov 24, 2015

JordonPhillips commented Nov 24, 2015

kyleknap commented Nov 24, 2015

mtdowling commented Nov 25, 2015

jamesls Nov 25, 2015

jamesls commented Nov 25, 2015

Automatically set EncodingType to url #726

Automatically set EncodingType to url #726

Conversation

JordonPhillips commented Nov 23, 2015

mtdowling Nov 23, 2015

Choose a reason for hiding this comment

JordonPhillips Nov 24, 2015

Choose a reason for hiding this comment

mtdowling Nov 24, 2015

Choose a reason for hiding this comment

kyleknap Nov 24, 2015

Choose a reason for hiding this comment

kyleknap commented Nov 24, 2015

kyleknap Nov 24, 2015

Choose a reason for hiding this comment

kyleknap commented Nov 24, 2015

JordonPhillips commented Nov 24, 2015

kyleknap commented Nov 24, 2015

mtdowling commented Nov 25, 2015

jamesls Nov 25, 2015

Choose a reason for hiding this comment

jamesls commented Nov 25, 2015